搜索资源列表
gmeans
- gmeans-- Clustering with first variation and splitting 文本聚类算法Gmeans ,使用了3种相似度函数,cosine,euclidean ,KL.文本数据使用的是稀疏矩阵形式. -gmeans clustering with first variation and splitting Gmeans,a text clustering algorithm, uses 3 functions,cosine,euclidean and
classifier-1.12
- 能对从Google中搜索出来的文本进行聚类,提供了Java包,及调用源代码.-can right from the Google Search for the text clustering, a Java package, source code and call.
main.计算文本之间相似度的程序
- 计算文本之间相似度的程序,用于文本的聚类。是在已知各个文本的文本特征向量基础上进行计算的,利用余弦值计算,Calculation of similarity between the text of the procedures for text clustering. Are known at all the text of the text feature vector calculated based on the use of cosine values
DATA
- 文本聚类分类数据集 包括20newsgroup 和retuers 中抽取的500条数据,有四个表-Text clustering and classification of data sets including 20newsgroup retuers 500 extracted data, there are four tables
WawaTextCluster
- 文本聚类算法源码-Source text clustering algorithm
Kmeans
- 基于opencv的kmeans聚类实现 输入文本数据,进行聚类输出-Opencv-based clustering of kmeans the input text data, clustering output
0010
- 基于WEKA平台的文本聚类及实现,以及常用的文本聚类效果评价指标-Text clustering based on WEKA platform and implementation, as well as common text clustering validity
AP
- 最新AP聚类算法以及演示程序,算法内容参照affinity appropagation in science。-AP latest clustering algorithm as well as the demo program, algorithm reference content affinity appropagation in science.
TDHCursorFactory
- 是一个使用Perl语言编写的一个开源文本挖掘的程序。其中涉及了众多文本挖掘的技术,如文本聚类、分词、索引,搜索引擎、字典等等。-Perl is a language to use an open source text mining process. Involving a large number of text mining technologies, such as text clustering, segmentation, indexing, search engines, dicti
pj
- 一种基于语义内积空间模型的文本聚类算法,这种算法有很好的效率,拿出来和大家分享下-A semantic-based model of inner product space the text clustering algorithm, this algorithm has good efficiency, and to share out under the
CheckText
- 实现文本相似度检查的文本层次聚类算法和划分算法的C#源码-Text to check the text similarity hierarchical clustering algorithm and classification algorithm C# source code
textcluster
- java文本聚类程序代码文件,实现文本聚类功能,分词。-text clustering java code files to achieve text clustering features, sub-word.
1
- 关于PSO优化后文本聚类的算法--MATLAB-Text on the PSO clustering algorithm optimized- MATLAB
Clustering
- Frequent Term Based Text Clustering
TextClusteringKmeans
- 从文本文件读入文本,分词,去停顿词,然后利用kmeans进行文本聚类-Text Clustering with K means
1
- 基于WEKA平台的文本聚类研究与实现 文本聚类是文本挖掘领域的一个重要研究分支,是聚类方法在文本处理领域的应用。本文对基于空间向量模型的文本聚类过程做了较深入的讨论和总结,利用文本语料库,基于数据挖掘工具研究并实现了文本聚类的过程。本文首先给出了文本聚类的思想和过程,回顾了文本聚类领域的已有成果,列举了文本聚类领域在特征表示、特征提取等方面的基础研究工作。另外,本文回顾了现有的文本聚类算法,以及常用的文本聚类效果评价指标。在研究了已有成果的基础上,本文利用20 Newsgroup文本语料库,
ClassText
- Several Clustering and classification methods for text documents
Text-Clustering-Doc
- Text Document Clustering Document for the MCA and Mtech Students for the final year projects.This Document Clustering is based on several algorithms and full document is presented here.
Text-clustering
- 机器学习中文本聚类算法,里面有5个文件,包含Python实现代码和测试数据。-The clustering algorithm machine learning Chinese, there are five files that contain Python implementation code and test data.
DBSCAN Clustering
- 基于matlab的dbscancluster的实现可用于文本聚类(The implementation of dbscancluster based on Matlab can be used for text clustering)